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^_^ ' Abstract: We develop a test to determine whether a function lying in a 

Cn ' fixed L2— Sobolev-type ball of smoothness i, and generating a noisy signal, 

5, is in fact of a given smoothness s > 4 or not. While it is impossible to 

Q.. construct a uniformly consistent test for this problem on every function of 

^^ ■ smoothness t, it becomes possible if we remove a sufficiently large region 

^~N ' of the set of functions of smoothness t. The functions that we remove are 

/■^v ' functions of smoothness strictly smaller than s, but that are very close to 

s— smooth functions. A lower bound on the size of this region has been 
proved to be of order n~*' (^*+^' -^'j and in this paper, we provide a test 
L^ ■ that is consistent after the removal of a region of such a size. Even though 

the null hypothesis is composite, the size of the region we remove does not 
depend on the complexity of the null hypothesis. 



Keywords and phrases: Non-parametric composite testing problem, Min- 
C^ ' imax bounds. Functional analysis. 

1. Introduction 

^vj , We consider in this paper a composite testing problein in the non-parametric 

Q>^ ' Gaussian regression setting. Assuming that the unknown regression function / 

10 ! lies in a given smoothness class (indexed by t), we want to decide whether / 

Cn ■ is in fact in a much more regular class (indexed by s > t), by constructing a 

^^ I suitable test. More precisely we consider the setting of testing between two fixed 

^D ■ L2 Sobolev-type classes, which we define formally in Section 2 below. 

^n I Let Y,{t, B) be the ^2— Sobolev-type ball of functions in [0, 1] of smoothness t 

and radius B, and let I](s, B) with s > i be a sub-model (i.e. I](s, B) C S(i, B)). 

We assume that we have observations generated according to a Gaussian non- 
k^ ■ parametric regression model with underlying function /, at noise level n, where 

^; /GE(s,S)or/eE(i,i?)\E(s,i?). 

C^ ■ For G C L2, set ||/ - G||2 = infgec 11/ - sib- We define for pn > the sets 

E(t,S,p„) = {,/eE(t,S):||/-E(5,B)||2>p„}. 

Note that these sets are separated away from Yi{s,B) whenever p„ > 0. They 
correspond to E(i, B) \ E(s, B) where we have removed some critical functions, 
very close to functions in Ti{s,B). 

We are interested in the composite testing problem: 

Ho:feJ:is,B) vs. H^:fet{t,B,p^). (1.1) 

1 
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More precisely, we want to know the minimal order of magnitude of p„ that 
enables the construction of a uniformly consistent test ^n between Hq and Hi, 
i.e. of a test such that for some TV that depends on TJo, Hi and a only, it holds 
for any n > N that. 



sup E/*„ + sup E/ 1 - *„ < a. 
feHo feHi ^ ' 

Two topics that are closely related to this question have been thoroughly 
studied. The first one is nonparametric signal detection where i^o = {0}. The 
second is the creation of adaptive and honest nonparametric confidence bands 
around functions. 

Let us first recall the results obtained in signal detection where one wishes 
to test 



H. 



/ = vs. i/i:/e{/eS(t,B):||/-0i|2>p„}. (1.2) 

As in any testing problem, in order to obtain a uniformly consistent test, the 
model has to be restricted such that the elements in Hq are not too close to the 
ones in H\. This explains the presence of the separation sequence p„. Ingster 
(1987, 1993); Spokoiny (1996); Ingster and Suslina (2002) prove that the min- 
imal order of p„ that enables the existence of a consistent test in the above 
problem is 

Pn > Dn'^^^^Ts _ 

For pn of this order, the authors also build a consistent test for the problem (1.2). 
They prove that the testing problem is equivalent to testing whether the sum of 
the squares of the means of independent (or close to independent) sub-Gaussian 
random variables is null or not, and the usual theory of the Chi-squared test 
applies. The size p„ of the separation area is related to the minimax rate of 
estimation of the L2 norm of / under the alternative hypothesis. A question 
that arises is how the results change when the null hypothesis is a composite 
hypothesis, in our case an infinite dimensional Sobolev-type ball. 

The testing problem described in Equation (1.1) is also closely related to 
the problem of the creation of confidence bands around functions - see for 
instance Hoffmann and Lepski (2002); Juditsky and Lambert-Lacroix (2003); 
Hoffmann and Nickl (2011); Bull and Nickl (2013) where this relation is made 
clear. Despite the fact that there exists a quite complete and satisfying theory 
for adaptive nonparametric estimation - sec e.g. Lepski (1992); Donoho ct al. 
(1996); Barron et al. (1999); Tsybakov (2003) - the theory of honest and adap- 
tive confidence sets has some fundamental limitations. Indeed, one has to re- 
move critical regions from the parameter space in order to construct honest and 
adaptive confidence sets, see Low (1997); Cai and Low (2004); Gine and Nickl 
(2010); Hoffmann and Nickl (2011); Nickl (2013); Bull and Nickl (2013). In the 
recent paper Bull and Nickl (2013), the problem of L2-honest and adaptive con- 
fidence sets is considered and in the course of the proofs, the authors establish 
that in the testing problem (1.1), Pn can be taken of the order 

/9„>L)max(n 2t+i/2^j^ ^''+^), 
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for D large enough depending on the level of the test and on s, t. On the other 
hand, they prove in the case of density estimation (we provide a proof of this 
fact in our setting, see Theorem 3.2 below) that the lower bound for yO„ is 

Pn > ZJ'rt" 2t+l/2 ^ 

for some D' positive; otherwise there exists no consistent test for the prob- 
lem (1.1). In the case s < 2t, the upper and lower bound do not match (which 
in their context of confidence sets is unimportant, for related results see Baraud 
(2004); Cai and Low (2006); Robins and Van Der Vaart (2006); BuU and Nickl 
(2013)). 

From the point of view of hypothesis testing, the case s < 2i is in fact of 
particular interest, as it implicitly addresses the question whether the complex- 
ity of the null hypothesis should influence the separation rate in non-parametric 
composite testing problems. When s > 2t, the rate of estimation in the null 
hypothesis is much smaller than the separation rate, and a reduction to a sin- 
gleton null hypothesis is intuitively speaking always possible as shown by the 
infimum test considered in Bull and Nickl (2013). For s < 2t, the infimum test 
cannot be expected to work for p„ smaller than ?7~''/(2s+i)^ go new ideas seem 
to be required. 

In this paper, we demonstrate that the complexity of the null hypothesis does 
not influence the separation rate in the testing problem (1.1). More precisely, 
we prove that it is possible to build a test that is uniformly consistent with a 
separation rate 

The test we propose uses the geometric structure of the Sobolev-type balls 
combined with a simple multiple testing idea based on Chi-square statistics, 
and is straightforward to implement. In this paper, the Sobolev-type spaces we 
consider are the Br,2,oo Besov spaces, and the fact that the third index g = oo 
is important. 

Section 2 formalises the setting and notations that we consider. Section 3 
provides the test and the main Theorems. Proofs are given in Section 4 and 5. 

2. Setting 

Denote by L2([0,l]) = L2 the space of functions defined on [0,1] such that 
II/II2 — /n \I{^)\'^dx < 4-00; where ||.||2 is the usual L2 norm. For any functions 
(/1.9) •= L2, we consider the usual scalar product (/, .g) = /q f{x)g{x)dx. 

2.1. Wavelet basis 

Let Jo € N. We consider an orthonormal and compactly supported wavelet basis 

{(l)k,ke Zj^,iJi,k,l > JoJe N,k e Zi}, 
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such that VZ > Jq.Zi <zZ,yi> Jo + 1, \Zi\ = 2', \Zjg\ < oo, and where as usual 
■tPij. = 2'/^7/'fe(2'.). Also we assume that V/ > Jo + l,Vfc G Zi,J^ i}ji^k{x)dx = 0. 
We write Vfc S Zjg,ipj^,k = i'k in order to simplify notations. Such a basis 
exists (for instance the Cohen-Daubechies-Vial basis verifies all these conditions, 
see Cohen et al. (1993)). 

For any function f G L2, we consider the sequence a(/) = a of coefficients 
such that yi > Jo,yk e Zi, 

at.k = / i^i,k{x)f{x)dx = (V'/.A;,/)- 
"'0 

The functions f £ L2 have the representation 

/ == X! X! V'i,fc(V'i,fc, /) = X! X! °-i,k^i,k- (2.1) 

l>Jo kGZi l>Jo keZi 

We moreover write for any J > Jq 

Jo<i<JkeZi 
the projection of / onto spa.n{ipi,k, Jo < ' < >^i ^ G Zi). We also write 

^Wj if) = X! '^J.k'^J.k 
keZj 

the projection of / onto span{ip j,k, k e Zj). 

2.2. Besov spaces 

We consider the Besov norms 

||/||..P,oo-snp(2'(-^+^/2-Vp)|(/,^,_.)|,^), 

l>Ja ^ ' 

where |m|; == (^^ u^)^'^ is the sequential l^ norm. 
The Besov spaces are defined as 

B,,p,oo = {! (^V : !l/iU,p,oo < +00}. 

We write for a given r > and a given i? > the i?r,2,oo Besov ball of 
smoothness r and radius i? as 

S(r,B):={/eB,,2,oo:|l/|lr,2,oo<S}. 

For regular enough wavelets (e.g. Cohen-Daubechies-Vial wavelets with S 
first null moments) , the defined Besov spaces correspond to the functional Besov 
spaces (Sobolev-type spaces) up to some smoothness S > s, see Meyer (1992); 
Hardle et al. (1998). We assume that our basis verifies this property with s < 
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S where s is the largest smoothness that we wish to consider in our testing 
problem. 

The spaces -Br-,2,oo are slightly larger than the usual Sobolev spaces, see Bergh and Lofstrom 
(1976); Besov et al. (1978). They are however the natural objects to consider 
for a smoothness test, since they are the largest Besov spaces where adaptive 
estimation remains possible (see Donoho et al. (1996); Bull and Nickl (2013)). 
Indeed, one can prove that there exists an estimate fniX'^^^) of / such that for 
any S >r > 1/2 and _B > 0, we have 

sup E||/„-/||2<0(n-'V(2'-+i)), 

/eE(r.S) 

see for instance Theorem 2 in the paper Bull and Nickl (2013) (with some sim- 
ple modifications needed for the regression situation considered in the present 
paper) . 

2.3. Observation scheme 

The data is a realisation of a Gaussian process defined for any x G [0,1] and for 
a given n as 

dr(")(x-) = /(x)cix + ^, 

where {Bx)x(^[i),i\ is a standard Brownian motion, and f (z L2 is the function of 
interest. 

Let us write for any I > Jq and k £ Zi the associated wavelet coefficients as 

ai^k^ {i^i.k^dY'''^^), and ai^k ^ {ipi,k, f)- 

We consider the wavelet estimate of /: 

/n = ^ y^^ai,kipi,k- 

l>.Ja k 

This estimate is of infinite variance in L2, hence projected estimates 

fn{j) :=ny^./„, 

have to be considered. 

In the sequel, we write Pr^ (respectively Ey) the probability (respectively 
expectation) under the law of dY^"^' when the function underlying the data is 
/. When no confusion is likely to arise, we write simply Pr (respectively E). 

3. Testing problem 

3.1. Formulation of the testing problem 

Let s > t. We want to test whether / is in S](s, B), or whether / is outside this 
ball, i.e. in Y,{t,B) \ S](s,i?). This is generally impossible to do uniformly and 
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functions that arc t smooth but too close from s smooth functions (such that the 
-L2 distance between these functions and the Sobolev-type ball of smoothness s 
is smaller than p„) have to be removed. 

Let us first define the restriction of the sets S(t, B) to sets that are separated 
away from T,[s,B) by some minimal distance p„: 

E(i, B, p„) = {/eS(i,i3): 11/ -S(s, 5)112 >p„}. 
The testing problem is the following. 

Ho:fe^{s,B) vs. ifi:/eE(t,B,p„). 

3.2. Main results 

Let j > Jo such that 2^' w ni/(2t+i/2)^ 

Consider for any Jq < I < j the test statistics 

r„(0 = ||nv^,/„||2-|. (3.1) 

These quantities Tn{l) are estimates of \\llwi (Z)!!! across all levels Jo < I < j. 
Concerning levels I > j , even in the worst case of smoothness t, the L2 norm 
of the function at these levels is smaller than n-*/(2t+i/2)^ -^^ 11/ — Hy. (/)|l2 = 
0(n~*/^^*"'"^/^^). This implies that one does not need to control for what happens 
at these levels. 

Let a > be the desired level of the test. Consider positive constants i„(0 
for any Jq < I < j . Set 

.,,2 fB r;\2 B^ B rf 

for a sequence (Ti)jg<i<j and a constant C such that 

2O+O/8 fc^ 

n = Tn,i=C ^— , and C>8\ — , 

Vn V a 

where C" > 1 is a universal constant. 
We consider the test: 

*„(a) = l- H 1{T„(0 < i„(/)'}, 

Jo<l<3 

that is, we reject Hq as soon as the test statistic at one of the levels Jq < I < 
j indicates a too large Besov norm. The intuition behind this test is that / 
belonging to T,{s,B) is equivalent to ||nvi(/)||s^2,oo being smaller than or equal 
to B for any I > Jq. As explained before, we do not need to be too concerned by 
what happens for / > j. In the case Jo < I < j, each statistic Tn{l) is designed 
to test this. We illustrate this in Figure 1. 

We provide the following definition of consistency for a test, following the 
line of work of Ingster and Suslina (2002). 
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'Z(s,B) ..-. Confidence zone in ^(s,B) for 

C .) riwif and thus Tn(l): ball of 
'Z(t,B) radius tn(l) 





NJo NJo + l 

Fig 1. Illustration of the testing problem. 



(V) Q 



l=j 



Definition 3.1 (a— consistency). Let a > and Hq,Hi be two hypotheses 
(functional sets). Let ^„(F'^"',iJo, -ffi,Q;) be a test, that is to say a measurable 
function taking values in {0, 1}. We say that VPn is a— consistent if we have for 
any n > 

sup E/*„ + sup E/(l - ^n) < a. 

f<£Ho feHi 

We now state the main result of this paper. 

Theorem 3.1. Let a > 0. The test ^„(a) is an a— consistent test for discrim- 
inating between Hq and Hi and for p„ = (7(0)71^*' '^'^-^ "' , where C{a) is such 

that C{a) > 16{B + 1)\ —, for C" > 1 some universal constant. 



The proof of this theorem is in Section 4. The region we had to remove so 
that \E'„(a) is a— consistent could not have been taken significantly smaller, as 
stated in the next Theorem. 

Theorem 3.2. Let 1 > a > 0. There exists no a-- consistent test for dis- 
criminating between Hq and Hi and for p„ == D{a)n~'^'^'^'^^^''^' , where D{a) < 



1I/4 



32 



B 



The proof of Theorem 3.2 is in Section 5. It is very similar to the proofs in 
papers Ingstcr (1987); Bull and Nickl (2013) (the proof in paper (Bull and Nickl, 
2013) holds in the more involved case of density estimation). 

We would like to emphasise that the test ^n, in addition to being rather 
simple conceptually, is quite easy to implement since it requires only the com- 
putation of (significantly) less than n integrals/sums - the empirical coefficients 
- and less than log(7i) sums of squares of these coefficients. It can replace the 
more complicated infimum test considered in the paper Bull and Nickl (2013) 
for the creation of adaptive and honest confidence bands. 
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3.3. Alternative settings 

We provided in the last Subsection a consistent test on a model that could not 
have been taken significantly larger. This test was constructed in the rather sim- 
plistic setting of non-parametric Gaussian homoscedastic regression with nor- 
malised variance. But in many cases (see e.g. Reifi (2008); Nussbaum (1996)), 
it has been proven that it generalises rather well to more realistic and com- 
plex settings. The concern in our case, however, is that we heavily rely on the 
homoscedasticity assumption with known variance of the noise. Indeed, we sub- 
stract the constant part induced by this variance in the estimates of Tn{l) in 
Equation (3.1). This part is much larger than the deviations (in high proba- 
bility) of IlIIvK, /„|j| around its mean, and it is thus crucial to remove it. We 
illustrate this in Figure 2. 

There is however a way around this problem that we discuss now, as well as 
generalizations to more complex settings. 

■ n„f„ with high probability 



^i+^,/4 y n f r,+ r:/4 




+ 



^wflL Elln tir =^ +||n fir 

I ^ "— II w, "II2 n II w, II2 

T„(l) with high probability || nj„|f with high probability 



Fig 2. Statistics T„{1) and the removal of the expectation of the square of the expectation of 
the noise. 



Heteroscedastic non-parametric Gaussian regression: Assume now that the 
data are generated according to the process 



dy(")(x) = /(x)dx 



a{x)dBx 



where (i?x)a;6[o,i] is a- standard Brownian motion, and /, cr G ^2- Since the 
function a is unknown, we can not apply the technique we described. However, 
if we know a upper bound on ||cr||2, it is still possible to solve this problem with 
a very similar technique. 

The modification goes as follows. We start by dividing the initial sample in two 
sub-samples of equal size n/2. Then we compute the empirical estimates of the 
function in these two samples and write /A and /„ for the estimates of the 
function computed in each of the two halves. We then define the statistics Tn{l) 
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(which play the same role as the Tn{l)) as 

f„(/) = (nu.j(i),nH.j(^)). (3.2) 

Since /A and fn are independent estimates of /, the additional term that 
comes from the expectation of the square of the noise (the variance) disappears 
and it is possible to prove that this newly defined T„(Z) concentrates around 
||nvi/,/||2 with an error of same order as in Lemma 4.2 below. This implies that 
we can test in a similar way and derive similar results. 

Regression, density estimation and autoregressive model: The settings of non- 
parametric regression (with noise that can be non- Gaussian), of non-parametric 
density estimation, and of non-parametric auto-regressive model {AR(l)) are 
not too different from the heterosccdastic setting under a given set of assump- 
tions (that e.g. the noise on the data is sub-Gaussian and that the design is 
adapted for regression, and that e.g. the regression function/density is in Loo, 
see (Bull and Nickl, 2013)). This follows from the asymptotic equivalence be- 
tween these models and non-parametric Gaussian regression (again, see e.g. Reil3 
(2008); Nussbaum (1996)). 

• In the regression setting, we assume that the n data are 

Y,^f{X,)+a{X,)e^, 

where e^ are independent random variables of mean and variance 1. 
Based on these data, we can compute also estimates for the wavelet coef- 
ficients of / as 

1 " 

n ^-^ 
i=i 

and thus estimate /. Then we can follow the procedure described in 
the setting of heterosccdastic non-parametric Gaussian regression (Equa- 
tion (3.2)). However, one needs to be careful in this setting since the design 
(i.e. position of the Xi) is crucial. Indeed, wavelets are very localised func- 
tions and estimating the wavelet coefficients in a reasonably accurate way 
requires that the points Xi are spread over the whole domain, that is to 
say that there are enough points in each region of the domain. In par- 
ticular, a standard random design will fail in this case, see Hardle et al. 
(1998). 

• In the density estimation setting, we assume that the n data generated by 
/ are {Xi)i, and estimate the wavelet coefficients of / as 



ai 



1 " 

,fc = - V-0Z,fc(XO: 
77, ^ ^ 



and thus estimate /. Then we can follow the procedure described in 
the setting of heterosccdastic non-parametric Gaussian regression (Equa- 
tion (3.2)). 
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• We consider finally the non-parametric autoregressive model with memory 
1 (or AR{\)). The output (Xi)i<„ of an AR{\) can be described as follows: 

X,+i = f{X,) + a{Xi)e,. 

After sub-sampling the data at random in order to make them close to 
independent, one can go back to the regression setting, and apply the 
same method (see e.g. (Hoffmann, 1999) for equivalence of this setting 
and regression setting after sub-sampling). 

4. Proof of Theorem 3.1 

This Section contains a proof of Theorem 3.1. 

4.1. Decomposition of the problem 

The statistics r„(l) are unbiased estimates of Ullvi^, (/)||2 for any Jo < I < j, as 
explained later in this Section. Assuming this, the next Lemma explains why 
the test \l/„ that we described is a reasonable thing to do. 

Lemma 4.1. Let {ti)j^-,<i<j be a sequence of positive real numbers. Assume that 
Pn > i4^B2-^* +4/'Sj2i<j~n)- Then we have 

• feHo^ maxj„<(<, (||nM/,(/)i|2 - ^) < 0. 

• f e Hi^ max,/o<;<j (i|nw,(./)i|2 - ^ - r;) > 0. 

Proof. Under the null Hypothesis Hq 

If / is in I](s, B) then by definition of the Besov spaces 

||ny^/||,,2,oo<B, 

which implies by definition of the ||.||o.2,oo norm that 

sup (^||nw,/||o,2,oo - ^j < 0. 

This implies by Parseval's identity, and by definition of W; (since |jnw,/||o2cc = 

mwjh) 

B \ /„„ „„ B 

^ ) = sup 



sup ( \v^wJh -^) = sup (l|nH'-J||o,2,oo - ^) < 0. 



Under the alternative Hypothesis Hi 

Assume that / is in 'E{t, B, p^). By triangular inequality we have 

inf ||/-,9||2< inf \\TlvAf) - gh + WTly-iDh 

3GS(.s,B) seS(s,B) 

< inf ||ny^.(/)-g||2 + B2--'"*, 
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where we set V^ = U; ^ \ ^'i ^-nd since by definition of tlie Besov spaces, we 
know tliat ||ny-/||2 < i?2^^*. We tlius liave, since p„ < ''irdg^Y.(s,B) ||/^,9||2 by 
definition of E(t, B, pn), 

3p„/4 < p„ - 52-^'* < inf ||ny^.(/)-.g||2. (4.1) 

Let us write {ai^k)i,k the coefficients of / and {bi_k)i.k the coefficients of the 
minimiser g. We have by definition of T,(s, B), by the triangular inequality and 
by Parseval's identity 

inf llny^ (/) - gh < inf ^^ ^ l|nw,(/) - dh 

geS{s,B) s6E(s,B) ,^ 

I— Jo 






J 



inf V" , y^io,i.k -bi.kY 

{bi,k)i,k:yi>Jo,2'^\\b,,.\\,^<Bj^ \l^ 



V inf JV(ai.fc-6,.fc)2 



;^^^(;'!,/=)i,fc:2'=||6,J|,,<i3' ^_ 

since the constraints defining the minimisation problems involved do not interact 
across the levels I. The last equation, together with Equation (4.1), implies that 



By definition, p„ > 4/3 ^;^j tj, so the last equation implies that 



V r, < V inf ^ y2iai,k - k^u?. (4.2) 



-Jo ^^j^(K.hx-2^^\\K-K<B\. , 



At least one oftherj's has to be less than or equal to inf (hj t); fc:2'=l[6, H,, <b \/^k^o,i.k — h,kY^ 
as otherwise X]/=j "^^ would exceed the right hand side in Equation (4.2). Let 
•^0 < ^ < j be one of these indexes, we have 



Ti < inf y^{ai,k - hi^kY 



<max(0, /;^a2,,-^ 

<l|nw-,(/)l|2-^. 

since ''miv£{w£L2,\\w\\-i<i} \\u — v\\2 = max(0, ||u||2 — 1) by definition of a L2— ball. 
This concludes the proof. D 
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4-2. Convergence tools for Tn{l) 

The next Lemma is a standard and also rather weak concentration inequality 
(see e.g. Birge (2001) for similar results). 

Lemma 4.2. Let A > 0. There exists an universal constant C" > 1 such that 



Pr {Wl ■.Jo<l< J, |r„(0 - lln^Jll^l > 2^^C'(^^+2'/4Mll|M)} 



< A. 



Proof. Let Jo <l <j- 

Note first that by Parseval's identity, we have |lnvi/-j/„||2 = J2k '^f k- Then we 
have by definition T„(Z) = ^^ a^j, — — . 

We have ai^k = a'i,k+&i,k — cii,k where a,i,k — ai^k is a A^(0, 1/n) (by Assumption 
of the Gaussian model), and thus we have 

E\ai^k\^ ^ - + al,. 
n 

Also since by Ingster and Suslina (2002) (Page 86) there exists a constant C" > 1 
such that Y{G + m)^ < C"(l + rn^) where to is a constant and G is a iV(0, 1), 
we have 









y\ai,k\^<C 


n'' n 






This 


implies 


since 


the aj.fc are independent Gaussian 


random 


variables 








keZi 


nl 

kh ' " 


; 




and 






kez, 


, 2' ^ Ekez^ 
n^ n 


7^ 





This implies by Chebyshev's inequality that for any Si > 0, we have 



and since \\TlwJn\\l = Ekez, ^Ik and ||nvv,/||2 = J^kez, '^Ik that 



Pr||l|nvyj„|li - - - E[|inH.,/„lli]| > \jC'{^ + MIElM)| < 5i. 

This implies by definition of T„(Z) 



Pr{|T„(0H|n..j||^|>Jic4 + HIlilM)} 
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which implies by an union bound over aU Jo < I < j, that we have 



13 



Pr {yi:Jo<i< J, |r„(0 - iin^^jii^l > J^c'(^ + B^^ 

Set for any Jo < / < j, Si = (2-(J~')/2 + 2-'/'')A/4. Then 
Pr{v/: Jo</<j, |T„(0-||nw',/| 
This conchides the proof. 



)} ^ E ^^ 

Jo<i<i 



~ *' A ^ n2 n • 



}^i: 



<5z<A. 



Jo<l<j 

D 



4-3. Study of the test 

Let A > be the desired level of the test. 

We recall that j is such that 2^ ~ yji/(2t+i/2) g^yj^^ ^^ g^,^ Jq,. g^j^y Jq < I < j 

the positive real numbers 

2(i+0/8 /^^ 

Ti = Tni ~ C z= — for C such that C > 8\ —r-. 

Vn V A 

We set 

p„ = 2C(S + l)n-*/(2*+i/2). 

In particular with this definition, 



p„ > CB2-^* + 4/3 J2 ^'■ 

Jo<(<i 

Set also for any Jq <l < j the positive real numbers 

B , \2 52 Q 2 



t, 



>(0^-(^ + V2)' 



22Z.- ^ 2' 



:Ti 



(4.3) 



^.5.-Z. Null Hypothesis 

Since / G I](s, i?), by Lemma 4.1, 



max 

Jo<l<3 



{mwj\ 



B , 

^-2^1^0- 
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Thus by Lemma 4.2, we have with probabihty at least 1 — A that for any 



V A n-^ 71 ' 



T„(/)<||nwj||^ + 2y-c'( 

<^(^ + 2x2V^/^)+2,/>^^^i:i^ 
- 2'H2''* y An J V A n 



B^ B A^ 20-+')/« , ^ AT-;2(^+^ 

^^ + VVa^ ^IV^ + ^V A^ ^^ 



-B „ /T7;2(J+')/8n2 



< hrT- + 2A -C' 



.2'« V A ni/2 ) 



since C > 8W^. 

So with probabihty at least 1 — A, we have ^n = under iJo- 

4-3.2. Alternative hypothesis 

The sequence (t;);, and p„ verify the assumptions of Lemma 4.1 (see Equa- 
tion (4.3)). 

If Hi is verified, then 

B 



mjx^.(||n;^,(/)|b-^-r,)>0, 



see Lemma 4.1. So there exists Jo < ^ < j such that 

2^ 
By Lemma 4.2, we have with probability at least 1 — A that for this / 



\IiwAf)h>,^ + ri. 



; 1 , <■ 

T„{l)>\\nwJ\\l-2^-C'[c- 



2(^-+')/^ , .uJ^wJWl 



2'/4j 



/ B \ / B fl 2'/4 N / 1 2(J+0/2 

^(^-)(j^-/^)-;f^ 

> 



(^+V2) =t„(0^ 
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since C > sJ^. 

So with probability at least 1 — A, we have ^n = 1 under Hi. 

Conclusion on the test ^„ All the inequalities developed earlier are true 

for any / in Hq or Hi with constants depending only on s,t,B,A and the 

supremuni over / in Hq and Hi of the error of type one and two are bounded 

by A. Finally, the test ^„ of errors of type 1 and 2 bounded by A distinguishes 

_ t 
between Hq and Hi with condition p„ = 2BCn 21+172 fQj- ^ value C large 

enough (but depending only on B,d). This implies that: 

sup E/^„+ sup E/(1-*„)<2A. 

/GS(s,B) fet{t.B,p„) 

5. Proof of Theorem 3.2 

Let 1 > u > 0, s > t > 0, and j e N* such that 2^ « n^^^ . 

Step 1: Definition of a testing problem on some large set. Define the 
set 

Consider the sequence of coefficients indexed by a given a e / as 

(a) 

where a^ = „] „ . Consider the function associated to a'-"^ that we write f*^") 
and that wc define as 

00 

1=Jq k&Zi keZj 

Consider the testing problem 

Ho:f = vs. ifi :/ = /("),«€/. (5.1) 

Step 2: Quantity of interest. An observation in the white noise model is 
equivalent, by sufficiency considerations, to an observation of empirical coef- 
ficients: equivalently to having access to the process y("\ wc have access to 
the empirical coefficients {ai^k)i,k (where d;.fe — J i'l^kdY^"'^) and each of these 
coefficients are independent J\f{ai^k, ^/n). Let ^ be a test, i.e. some measurable 
function (according to the empirical coefficients) taking values in {0, 1}. 
We have for any 77 > 



Eo[*] 



sup E^(c) [1 - *] > Eo [*] + ^ E %<°' [1 - *] 



>Er 



1{* = 1}] + 1{* = 0}Z 



> (1 - r;)Pro(Z > 1 - 77), (5.2) 
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dP, 



where Z — jjt '^Za&iWi k dP" ' where rf-P; ^ is the density of a;,fc when the 

function generating the data is f^°'\ and d-P^j. is the density of ai^k when the 
function generating the data is (this holds since the {ai^k)i.k are independent). 
More precisely, we have since the {ai^k)i,k are independent M{ai,k, I/jt-), and 
since a? = 1, that 



1 x-^ -i-r exp(-^(xi k - Ci)kY) 

z((xfc)fe)^z(K,),,)=-^n' 

' ' a£l Lk 



exp(-fx2,) 



TJtXI n exp(na;fe4"^)exp(--(4"')2), 



where {xk)k = {xj^k)k and [a^ )k = (ifc )fc- In the rest of the proof, we write 
also (ak)k = (pLj^k)k in order to simplify notations. 
By Markov and Cauchy Schwarz's inequality 



Pro(Z>l-,)>l-^^^^^>l-^^M^Hil!. (5.3) 

Step 3: Study of the term in Z. We have by definition of Z 

Eo[(2-l)'] 






><:p(~77(3;fc)^)da:i...a;23 



l,...X^j I I j^gj ^ 



exp(--(a;fe) )da::i...a;23 
2n7r ^ 



— V]Texp(a;fcna^°'')exp(--(a^"^)^)]T— =^exp(--(a::fc)^)da;i...a::2J +1 

/ (^En«-p(-'=-4"^)'^xp(-|(ai"V))'n^ 

2|77X]n/ ^^^exp(-^(a;fe-afc)^)da::fc + l 
1^1 ae/ fc -^--^ V27i7r 2 

/ (T7TX]n*^''p(^'='^"fc"^)^''p("^("fe"')^)) n^^^®''p("?(^'^) 



']dx\...X'^ 



by Fubini-Tonelli. This implies by developing the first term that 

Eo[(^-l)'] 



jwiT. f n«-p(^^-"(4"' + "i'^''))«-p(-?(("i"')' + (4"'^)'))^exp(-5(^o')d^i...rf-2.) 

' ' a,Q'e/"'==l---'^2J fc V2727r 

m^( E n/ exp(:c.7.(4")+ai"''))exp(-f((ai'^')^ + (4"'))^))^exp(-5(x.)^)d^-..)-l 
— — ( V] TT / exp(a;fe?Twa(afc +a'j.))exp(-ni;^a^)^^=exp(--(a;fc)^)da;fej - 1. 
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This implies by integrating depending on the respective values of au and a^ 
that 



1 



/■ 1 n 

+ exp(nu^a^)l{afc = a'fc = ~1} / ^ exp(--(a;fc + 2wa)^)(ia;/c 

Jai, v2n7r ^ 



exp(--(xfc - 2vaf)dxu 
zriTT ^ 



^^cxp(--4)da;fc 
2n7r ^ 



(5.4) 



a,a'Gl k 



Since the a and a' take respectively all possible values in { — 1, 1}^ , by definition 
of the expectation, and by replacing a and a' by R and R' in the formula, we 
have 



Tm Yl [■] ^^^C^O-.C^?^).!-]' 



Q,a'G/ 



where the {Ri)i, {R'j)j are two sequences of i.i.d. Rademacher random vari- 
ables that are also independent of each other. This implies together with Equa- 
tion (5.4) that 



Eo[{Z-l)^] =E(;,, 



>(«■). 



n(exp(- 

. fe 



^)(1 - l{Rk ^ R'k}) + exp{nv\^)l{Rk ^ <} 



- 1 



= n^«^'«'. [cxp(-nu2a2)(l - 1{R^ ^ R'^}) + cMnv^a^)HRk ^ K}) 

k 

since all i?^, R'^. are independent of each other. Moreover l{Rk ^ R'^] is a 
Bernouilli random variable of parameter 1/2 (since the two Rademacher are 
independent), which implies 



Eo[(Z- 1)^] = J|EB[exp(-nu2a2)(l - B) + cyi^{nv^a^)B 



1 



Eb cxp(-nu^a^)(l - B) + exp(7it;^a^)S 



-1, 
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where Eb[-] is the expectation according to a BernouiUi random variable with 
parameter 1/2. The last equation implies 

/I - m^a^ + {nv'^a^Y + 1 + nv'^a^ + {nv'^a^Y \ 2' 

since for any |u| < 1. wc have exp(u) < \ + u + v? . Since a? ~ ^^^ , we have 

Eo[(^-lf]<(l + ^)''-l 

<(exp(^))''-l=exp(^;4)_i 

< 1 + 2v^ - 1 = 2v^, 

since for any < m < 1, we have 1 + u < exp(M) < 1 + 2m. 

Step 4: Conclusion on the testing problem 5.1. By combining this with 
Equations (5.2), (5.3), we know that for n large enough 

Eo[*] + sup Ey(„) [!-*]>!- 2u'*, 

and since this holds with any ^, we have 

' > 1 - 2^;^ 



inf 



Eo[*]+ sup E^(„)[l-*] 

/(°),aG/ 



and this implies that there is no 1 — 2v^ consistent test for test 5.1 (and it holds 
for any < w < 1). 

Step 5: Translation of this result in terms of the test 1.1. Let v < B. 

Set 



vn 2t+i/2 
Pn = ;; 



By definition. 



|/^"'|lM,oo=.E(4"^)222j-*=t;<i?, 



so /(") e i](i,B). 

Also since only the j — th first coefiicients of / are non-zero, and since the 
constraints defining the minimisation over ^,{8,3) do not interact across the 
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levels (sec the proof of Lemma 4.1) we have Va G /, and for all n large enough 



f(")|l„ - R9-J^ 



geT,{s,B) 



>mw,r'h-BT 



vn 2'+i/2 

> 



by triangular inequality and since for any g e I](s,i?), ||nn/.(g)||2 < Sn 2t+i/2 < 
^7^,24+1/2 for n large enough, since s > t. This together with the fact that 
/(") G T.(t,B) implies that Va € /,/<") G t{t,B,pn). 

We know that G i;(s, B), and that Va, /"^"^ G S(t, B, p„) (by the previous 
equations). This implies that the testing problem 5.1 is a strictly easier problem 
than the testing problem 1.1, i.e. that 



inf 



/("),«£/ 



* 



Eo[*]+ sup Ey(o)[l-*] < inf sup E/[*]+ sup E/[l-*] 



/eE(s,B) fe-s{t.B,p„ 



Wc know that there is no 1 — 2v'^ consistent test for the test 5.1 and hence, 
there is no 1 — 2v* consistent test for test 1.1 (and it holds for any < u < 1). 
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